Method for making insertional mutations using a Tn5 synaptic complex

ABSTRACT

A method for making insertional mutations at random or quasi-random locations in the chromosomal or extra-chromosomal nucleic acid of a target cell includes the step of combining, in the target cell, cellular nucleic acid with a synaptic complex that comprises (a) a transposase protein and (b) a polynucleotide that comprises (1) a pair of nucleotide sequences adapted for operably interacting with Tn5 transposase and (2) a transposable nucleotide sequence therebetween, under conditions that mediate transpositions into the cellular DNA. In the method, the synaptic complex is formed in vitro under conditions that disfavor or prevent the synaptic complexes from undergoing productive transposition.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation-in-part of application Ser. No. 09/159,363, filed Sep. 23, 1998, U.S. Pat. No. 6,159,736, which application is incorporated herein by reference as if set forth in its entirety.

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT

Not applicable.

BACKGROUND OF THE INVENTION

Efficient insertion of exogenous nucleic acid into the chromosomal and extra-chromosomal nucleic acid of cells is desired in the art of molecular biology to identify chromosomal regions involved in expressing or regulating expression of peptides and proteins or to insert a nucleotide sequence of interest such as a primer binding site into a polynucleotide. This same technology is also advantageously used in developing new therapeutic and pharmacologic agents.

One common method relies upon in vivo Tn5 mutagenesis to insert polynucleotides of interest into cellular DNA and to construct libraries of cells that contain inserted polynucleotides at random or quasi-random locations. Existing in vivo Tn5 mutagenesis methods require target cells to encode transposase, either natively or from an introduced expression construct. Accordingly, it can be necessary to construct a suitable expression system appropriate to each target cell type. This can be time consuming, and requires extensive knowledge of the requirements of each target cell type.

In many cases, the gene that encodes transposase is encoded by an active transposon, which can continue to transpose in a target cell after the initial desired mutagenesis step. Such undesired residual transposition is undesired in that it complicates the analysis of insertional mutant libraries.

Furthermore, many techniques for in vivo Tn5 mutagenesis rely upon a complex biological mechanism for introducing exogenous DNA into the target cells, such as bacteriophage lambda transducing phage or a conjugating plasmid. It would be desirable to avoid requiring such complex biological systems.

The nature of natural Mu DNA, its ends, and its role in transposition are known. The left end includes 3 att repeat sites, denoted L1, L2 and L3 (all parallel). At the left end, only L1 is involved in transpososome formation. The nucleotide sequence of L1 is 5′-TGTATTGATTCACTTGAAGTACGAAAAAAA-3′ (SEQ ID NO:1). L2 and L3 are spaced significantly apart from L1. The right end includes att repeat sites R1, R2 and R3. R1 and R2 are located close to one another and are involved in complex formation. The nucleotide sequence of R1 is 5′-TGAAGCGGCGCACGAAAAACGCGAAAGCGT-3′ (SEQ ID NO:2). The nucleotide sequence of R2is 5′-GAAAGCGTTTCACGATAAATGCGAAAACTT-3′ (SEQ ID NO:3). R3 and L1 are inverted relative to R1 and R2. MuA transposase, encoded by phage Mu, can bind to all 6 att repeat sites, but the MuA tetramer in a transpososome footprints on only L1, R₁ and R2.

The Mu transposition reaction is most efficient when the Mu DNA includes one left end (containing L1, L2 and L3) and one right end (containing R1, R2 and R3). If the transposon ends are precut, strand transfer is most efficient if two right ends (containing R1 and R2) are used. If the non-transferred strand has a few (1 to 16 are equally effective) extra bases at the 5′ end, then the reaction is even more efficient.

A commercial system for inserting a selectable artificial Mu transposon into target DNA is available from Finnzymes Oy and is based upon the above-noted considerations. In the commercial Mu system, MuA transposase and the target DNA are mixed together ex vivo to form products that have a polynucleotide inserted into target DNA. The precut transposon ends of non-transferred strands are provided with four extra bases. Both transposon ends are right ends that include R1 and R2, with the final TT of R2 absent from the construct. The sequence of the right ends used in the commercial product, presented as SEQ ID NO:4, is:

gatcTGAAGCGGCGCACGAAAAACGCGAAAGCGTTTCACGATAAATGCGAAAAC 3′-ACTTCGCCGCGTGCTTTTTGCGCTTTGCCAAAGTGCTATTTACGCTTTTG-5′

The transposition products are subsequently delivered (e.g., by electroporation) into competent target cells. Cells that contain such extra-chromosomal transposition products are then selected using well-known methods. Transposition is completed outside of the target cell. In the commercial embodiment, the Mu transposon does not transpose into the DNA of the target cell.

Shoji-Tanaka, A., et al., B.B.R.C. 203:1756-1764 (1994) describe using purified retroviral integrase to mediate gene transfer into murine cells. Kuspa, A. and W. F. Loomis, P.N.A.S. U.S.A. 89:8803-8807 (1992) and others have described specifically integrating a plasmid linearized with a restriction enzyme into a genomic restriction site by electroporating enzyme-cut nucleic acid along with the cleaving enzyme into target cells.

BRIEF SUMMARY OF THE INVENTION

The present invention is summarized in that a method for efficiently inserting a transposable polynucleotide at random or quasi-random locations in the chromosomal or extra-chromosomal nucleic acid of a target cell includes the step of combining, in the target cell, cellular nucleic acid with a synaptic complex that comprises (a) a transposase protein complexed with (b) a polynucleotide that comprises (1) a pair of nucleotide sequences adapted for operably interacting with the transposase and (2) a transposable polynucleotide therebetween, under conditions that mediate transpositions into the cellular DNA. In the method, the synaptic complex is formed in vitro under conditions that disfavor or prevent the synaptic complexes, which are poised for transposition, from actually undergoing productive transposition. Synaptic complexes are then delivered into a target cell. In the target cell, the transposable polynucleotide is transposed into cellular target nucleic acid to form productive transposition products. Cells containing productive transposition products can be identified using a selection method such as antibiotic resistance.

The frequency of productive transposition of the transposable polynucleotide into the target nucleic acid can be enhanced by using in the method either a hyperactive transposase or a polynucleotide that comprises nucleotide sequences particularly well adapted for efficient transposition in the presence of the transposase, or both.

The present invention is further summarized in that a method for forming a library of cells that comprise insertional mutations includes the steps of combining in a plurality of target cells the cellular nucleic acid with the synaptic complex as described, and screening for cells that comprise insertional mutations.

In another aspect, the invention is further summarized as a library of cells that comprise insertional mutations formed according to the above-mentioned method. Such populations of cells that comprise random (or quasi-random) and independent mutational insertions in their genomes can be screened to select those cells that comprise an insertional mutation that induces a phenotypic or genotypic change relative to cells that were not subject to insertional mutagenesis.

It is an advantage of the present invention that the transposable polynucleotides used to form synaptic complexes can consist of transposon DNA apart from any flanking donor backbone (DBB) sequences. This is advantageous in that it reduces the likelihood of intramolecular transposition and increases the likelihood of transposition into a target genome. Moreover, eliminating DBB sequences from the polynucleotide simplifies preparation of the transposon sequences that can be used in the method.

It is another advantage of the present invention that the synaptic complex can form under conditions that disfavor non-productive intramolecular transposition events. This is advantageous in that substantially all of the synaptic complexes can undergo transposition when combined with the cellular DNA. Little, if any, of the nucleic acid in the synaptic complexes is inactive.

It is a feature of the present invention that transposition-promoting conditions are encountered only after the synaptic complex is in the presence of the target nucleic acid in the target cell.

Other objects, advantages, and features of the present invention will become apparent upon consideration of the following detailed description.

BRIEF DESCRIPTION OF THE SEVEPAL VIEWS OF THE DRAWINGS

FIG. 1 depicts the transfer of a synaptic complex into a target cell followed by selection of insertion mutants.

FIG. 2 a preferred transposase enzyme for use in a first embodiment of the method.

FIG. 3 depicts transposon terminal sequences that can be used in the transposable polynucleotide sequence in the first embodiment of the present invention.

FIG. 4 depicts a transposon or transposable polynucleotide sequence having mosaic ends and a transposable nucleic acid sequence therebetween.

DETAILED DESCRIPTION OF THE INVENTION

In a system for in vitro transposition using modified Tn5 transposase described in International Application No. PCT/US97/15941 (International Publication No. WO 98/10077), incorporated herein by reference in its entirety, the applicants demonstrated productive transposition of a transposable polynucleotide in vitro using a modified mutant Tn5 transposase and modified Tn5 polynucleotide sequences flanking a transposable nucleotide sequence.

A synaptic complex structure is an intermediate in the transposition of the transposable polynucleotide into the cellular nucleic acid. It is herein disclosed that one can separate synaptic complex formation from productive transposition by interfering with the ability of the synaptic complex to participate in a polynucleotide strand transfer reaction. In the method, the synaptic complex is formed in vitro under conditions that disfavor or prevent the synaptic complexes from undergoing productive transposition.

The invention can expand the use of transposon mutagenesis methods to, for example, bacteria that are not close relatives of E. coli and for which little is known about the structures of, and other requirements for, promoters and other regulatory sequences. In such cells, transposase may not be expressed in adequate amount, or may be unstable. The present method avoids the need to produce an endogenous transposase molecule. Moreover, since the introduced transposase is integrally complexed with the transposon polynucleotide, it is poised to act in the presence of any target DNA and, as such is sufficiently stable to facilitate transposition of the polynucleotide with which it is complexed. The transposase enzyme is preferably sufficiently free of other proteins and other factors that can inhibit the ability of the enzyme to complex with the donor polynucleotide or to catalyze transposition in the target cell. Suitable synaptic complexes can be formed between any transposase enzyme and compatible donor nucleic acid sequences in accordance with the invention.

One method discovered by the inventors to prevent the synaptic complexes from undergoing polynucleotide strand transfer is achieved by reducing or eliminating magnesium ions (Mg⁺⁺) from the reaction mixture. Thus, in a preferred embodiment of the method, a suitable transposase and a suitable transposable DNA element are combined in an in vitro reaction where the reaction mixture contains magnesium at a level insufficient to support polynucleotide strand transfer, or more preferably, a mixture free of magnesium ions. A suitable reaction buffer for forming the synaptic complexes can be the reaction buffer described on page 19 of the incorporated international application, modified to remove the magnesium acetate. One can also remove the BSA and spermidine from the reaction buffer. The tRNA can be eliminated from the buffer, unless nucleases are present in the reaction, in which case the tRNA can be added. It may be possible to further simplify the reaction buffer. Typical reactions are described below in the examples.

The transposase in the synaptic complexes can be a transposase that forms synaptic complexes efficiently in vitro (e.g., about 25% or more of the transposon DNA is converted into synaptic complexes with the transposase).

The transposable polynucleotide is preferably a linear polynucleotide comprising at its termini the inverted polynucleotide repeat sequences required for transposition. It is also possible, but less preferred, to employ a transposable polynucleotide that is linear but which comprises sequences in addition to the transposable nucleic acid and the terminal polynucleotide repeat sequences. Such additional sequences can be located to either side of the transposable polynucleotide, although this arrangement is less preferred because it adds extra steps to the transposition process. Some transposition can occur when the transposable polynucleotide is provided on a circular, supercoiled DNA molecule, although the transposition frequency is not as high as when the transposable sequence is a linear molecule.

The transposable nucleic acid sequence between the terminal repeat sequences can include any sequence that is desirably inserted into a target genome. FIG. 4 shows a schematic transposon having so-called mosaic ends and a transposable polynucleotide sequence therebetween. One skilled in the art can readily construct a desired transposable nucleic acid sequence to accomplish a particular goal. The invention is not limited to any particular nucleic acids between the inverted polynucleotide sequences. Rather, the sequence can be any detectable sequence, or a sequence that encodes a product that can be detected using methods known to those skilled in the art. By way of non-limiting example, the transposable nucleic acid sequence can provide the target cell with a selectable marker, which can be a peptide or protein encoded by the transposable nucleic acid sequence. The sequence can encode a protein that confers antibiotic resistance to a cell. Alternatively, the transposable polynucleotide can comprise a sequence whose presence can be detected in target cells. Such a sequence can include a cleavage sequence for a rare restriction enzyme, or any sequence for which a probe exists.

The transposable polynucleotide can also include sequences that regulate expression (transcription or translation) of nearby sequences. The regulatory sequences can facilitate expression of a protein or peptide encoded by a coding sequence in the transposable polynucleotide. Alternatively, the transposable polynucleotide can include regulatory sequences apart from any coding sequence or coding sequences apart from any regulatory sequence. In the former case, transposition events upstream from genomic coding sequences can provide the regulatory elements necessary to modulate transcription and/or translation of an endogenous coding sequence. In the latter case, the transposed sequence can reveal previously unknown regulatory sequences in the genome by introducing a marker gene whose product is produced only when the gene is transposed to a position adjacent to a regulatory sequence in the cellular nucleic acid.

Once formed from the above-identified components in vitro, the synaptic complex can be introduced into target cells using methods known to those of ordinary skill in the art. The synaptic complexes can alternatively and advantageously be stored in advance of use, for example, at −40° C. A preferred method for introducing the synaptic complex into the cell is electroporation, for example, using the example of Dower, W. J. et al, Nucleic Acids Research, 16:6127 (1988), incorporated herein by reference in its entirety. Transposition follows delivery into the cellular nucleic acid without additional intervention. If the synaptic complexes are present in a volume greater than a few microliters, it is also preferred to dialyze the synaptic complexes against a low-salt buffer, e.g., without limitation, 6 mM Tris, pH 7.5 with 7-10% glycerol, before use in an electroporation reaction to reduce salts in the reaction mixture to a level sufficiently low that they do not cause arcing between the electrodes. Other suitable methods for introducing the synaptic complexes into target cells are known and can include transformation-, transfection-, and liposome-mediated methods.

By introducing the synaptic complexes into a plurality of suitable target cells and selecting those cells that comprise insertional mutations, a library of cells that comprise random or quasi-random insertional mutations can be produced. To prepare a library of insertional mutants, the transposable nucleic sequence preferably contains a convenient selectable marker such as a gene that confers antibiotic resistance, so that cells lacking an insertional mutation in their cellular DNA can be readily distinguished from those in which productive transposition has occurred. Libraries formed according to the method can be screened for genotypic or phenotypic changes after transposition. At the molecular genetic level, one can employ standard analytical methods including hybridization, restriction fragment mapping, nucleotide sequencing, and combinations thereof, or other methods to identify genetic changes. At the phenotypic level, one can evaluate members of a library of transposition mutants for individual mutants having an altered growth property or other phenotype.

A kit comprising synaptic complexes comprising transposable polynucleotide sequences having desired characteristics, such as those described, can be commercialized to facilitate rapid preparation of libraries comprising insertional mutations, wherein the inserted sequences have been designed to accomplish a particular goal, as described. The kit can also comprise cells into which the complexes can be introduced. The synaptic complexes of the present invention can be substantially free of polynucleotide molecules that have undergone productive transposition. Furthermore, because the synaptic complexes are prepared in an in vitro reaction, the complexes can be provided as a substantially pure preparation apart from other proteins, genetic material, and the like.

The present invention differs from prior systems in that in the present invention synaptic complexes are formed in vitro, although in vitro transposition is substantially absent. Rather, the synaptic complexes are introduced into target cells whereupon transposition in vivo readily occurs.

FIG. 1 presents a schematic view of one aspect of the present method for efficiently introducing insertional mutations at random or quasi-random locations in the chromosomal or extra-chromosomal nucleic acid of a target cell.

The system disclosed herein is flexible enough to be implemented using any combination of transposase enzyme and compatible donor polynucleotide having suitable ends. Combinations of transposases and compatible sequences adapted for operably interacting with the transposases include Tn5 transposase and Tn5 end DNAs as well as Mu transposase and Mu end DNAs. Other combinations that appear to be similarly adapted include Mariner transposase and Mariner end DNAs as well as Tn522 transposase and Tn522 end DNAs. Since the present invention provides a complex that readily transposes in the presence of target DNA no additional biological activities are required after complex formation.

In a first exemplified non-limiting embodiment, cellular nucleic acid in a target cell is combined with a synaptic complex that comprises (a) a Tn5 transposase protein (shown as a pair of adjacent circles in FIG. 1) and (b) a polynucleotide that comprises a pair of inverted nucleotide sequences (shown as arrows in FIG. 1) adapted for operably interacting with Tn5 transposase and a transposable nucleotide sequence therebetween, under conditions that mediate transpositions into the cellular DNA.

In this embodiment, the transposase can be a hyperactive Tn5 transposase such as that disclosed in International Application No. PCT/US97/15941. A preferred mutant Tn5 transposase is a mutant Tn5 transposase modified relative to a wild-type Tn5 transposase, the mutant transposase comprising a mutation at position 54, and a mutation at position 372, the mutant transposase having greater avidity for Tn5 outside end repeat sequences of a donor DNA and a lesser ability to form non-productive multimers than wild-type Tn5 transposase. A mutation at position 54 that confers a greater avidity for Tn5 outside end repeat sequences is a mutation from wild-type glutamic acid to lysine. A mutation at position 372 that causes a reduced ability to form non-productive multimers is a mutation from wild-type lysine to proline, which may facilitate a conformational change wherein the N and C termini move apart and allow DNA binding.

It is also preferred that Tn5 transposase be free of the so-called inhibitor protein, a protein encoded in partially overlapping sequence with the transposase which can interfere with transposase activity. In the method, the transposase is used in purified, or partially purified, form, and if the transposase enzyme is obtained from cells (using conventional methods) it may be possible to separate the transposase from the inhibitor protein before use in the method. However, it is also possible to genetically eliminate the possibility of having any contaminating inhibitor protein present by simply removing its start codon from the gene that encodes the transposase.

An AUG in the wild-type Tn5 transposase gene that encodes methionine at tranposase amino acid 56 is the first codon of the inhibitor protein. However, it has already been shown that replacement of the methionine at position 56 has no apparent effect upon the transposase activity, but at the same time prevents translation of the inhibitor protein, thus resulting in a somewhat higher transposition rate. Wiegard, T. W. and W. S. Reznikoff, “Characterization of Two Hypertransposing Tn5 Mutants,” J. Bact. 174:1229-1239 (1992), incorporated herein by reference. In particular, the present inventors have replaced the methionine with alanine in the preferred embodiment (and have replaced the methionine-encoding AUG codon with an alanine-encoding GCC). A preferred transposase of the present invention therefore includes an amino acid other than methionine at amino acid position 56, although this change can be considered merely technically advantageous (since it ensures the absence of the inhibitor protein from the in vitro system) and not essential to the invention (since other means can be used to eliminate the inhibitor protein from the in vitro system). Shown schematically in FIG. 2 is a preferred transposase enzyme having mutations at positions 54, 56, and 372 relative to wild type Tn5 transposase.

The transposable polynucleotide in the synaptic complexes of the first embodiment is characterized as a nucleotide sequence flanked by a pair of inverted polynucleotides that comprise an 18 or 19 base long sequence that can function in an in vitro Tn5 transposition system. The polynucleotide (or any portion thereof) can be synthesized using methods well known to those skilled in the art, or can be prepared using methods for genetically engineering nucleic acid fragments. Referring now to FIG. 3, the known nucleic acid sequence of an outside end (OE) terminal sequence of a wild-type Tn5 transposon sequence (5′-CTGACTCTTATACACAAGT-3′; SEQ ID NO: 5) can be used as the flanking polynucleotides, as can a terminus of an inside end (IE) terminal sequence of a wild-type Tn5 transposon sequence (5′-CTGTCTCTTGATCAGATCT-3′; SEQ ID NO: 6). Although the wild-type OE terminus sequence can be used, the inventors have demonstrated that transposition frequencies at least as high as, and typically significantly higher than, that of wild-type OE can be achieved if the termini in a construct are mosaics intermediate between OE and IE sequences. Preferred mosaic terminus sequences include bases ATA at positions 10, 11, and 12, respectively, as well as the nucleotides in common between wild-type OE and IE (e.g., at positions 1-3, 5-9, 13, 14, 16, and optionally 19). The nucleotides at position 4, 15, 17, and 18 can correspond to the nucleotides found in those positions in either wild-type OE or wild-type IE. It is noted that the transposition frequency can be enhanced over that of wild-type OE if the nucleotide at position 4 is a T, when in combination with the above-noted nucleotides. Preferred mosaic sequences include CTGTCTCTTATACACATCT (SEQ ID NO: 7) and CTGTCTCTTATACAGATCT (SEQ ID NO: 8). Combinations of non-identical ends can also be employed.

In a second embodiment of the method, synaptic complexes are formed outside of a target cell by combining an artificial Mu transposon having bacteriophage Mu repeats at its termini with a Mu transposase (e.g., MuA protein). As in the first embodiment, the complexes are introduced into target cells, e.g., by electroporation or other method suitable for delivering nucleic acid-protein complexes. A preferred transposase is the 663 amino acid MuA protein encoded by ORF 1328-3319 of bacteriophage Mu, the sequence of which is reported in GenBank at locus AF083977 (version AF083977.1 G1:6010378) by Morgan, G. et al., incorporated herein by reference. this enzyme is commercially available from Finnzymes Oy. The preferred Mu ends are the right ends used in the product commercially available from Finnzymes Oy noted above which include a short single-stranded gatc tail added to the non-transferred strand.

The system provides a method for introducing mutations into cells quickly and efficiently. The two working embodiments of the invention, having their bases in distinct biological systems, demonstrate that the system can productively accommodate diverse synaptic complexes. Because the transposable polynucleotide sequence is provided, the system is not host-specific, and should work in any target cell. The invention finds particular utility in cells, particularly bacterial cells, that do not encode transposase, since the required transposase molecule is provided directly as part of the synaptic complex. The method is demonstrated to work in non-nucleated target systems, such as bacterial cells. In particular, the synaptic complexes have been introduced into E. coli cells (strain MG1655 is preferred because the sequence of its chromosome is known) and productive transposition has been observed. No scientific impediment is known to exist that would prevent use of the method in nucleated cells, such as archaebacteria, plant and animal cells, especially cells of nematodes, amphibians, and mammals, including, but not limited to, rodents and humans. In methods for introducing synaptic complexes into nucleated cells, it may be preferable to provide a nuclear localization signal on the synaptic complex, preferably as part of a genetically modified transposase protein.

The present invention will be more readily understood upon consideration of the following examples which are exemplary and are not intended to limit the scope of the invention.

EXAMPLES

In a reaction of 1 μl, 0.05 μg of a purified hyperactive transposase (EK45/MA56/LP372), the amino acid sequence of which is reported in incorporated International Application No. PCTJUS97/15941, was combined with 0.1 μg of a transposable polynucleotide comprising an expression cassette that encodes a protein that confers kanamycin resistance upon a target cell. The expression cassette was flanked by the mosaic ends described in the same incorporated International Application and shown in FIG. 3. The polynucleotide was provided, in separate reactions, as a supercoiled plasmid, a linearized plasmid, or as a polynucleotide fragment comprising at its termini inverted sequences required for transposition mediated by Tn5 transposase.

The mixture was incubated for one hour in the presence or absence of magnesium ions (Mg⁺⁺) in the reaction buffer noted above. In the absence of magnesium ions, synaptic complexes form, but no transposition occurs in vitro.

After incubation, the reaction mixture was mixed with 40 μl of E. coli strain MG1655 and subjected to electroporation according to the incorporated method of Dauer. The cells (1.4×10⁹) were plated on LB-Kan plates and kanamycin resistant colonies were counted. When 10 μl of reaction mix was used, the mix was dialyzed into a suitable buffer to prevent arcing between the electrodes.

TABLE 1 Tn5 Synaptic Complex electroporation/transposition DNA Tnp Mg⁺⁺ volume Kan⁴ CPU supercoiled* − + 1 μ1 None supercoiled* + + 1 μ1 1.6 × 10³ linearized* + + 1 μ1 6.9 × 10³ released element + + 1 μ1 3.9 × 10³ (PvuII cut) released element + − 1 μ1 5.3 × 10⁴ released element + − 1 μ1 (dialyzed) 4.0 × 10⁴ released element + − 10 μ1 (dialyzed)  5.5 × 10⁵ *contained donor backbone sequences

It is apparent from the table that transposition of the transposable polynucleotide that confers kanamycin resistance upon a cell is high when the reaction mixture is incubated in the absence of magnesium ions. Superior transposition was observed when the transposable polynucleotide contains no sequences flanking the transposable portion (“released element”). A greater than fold reduction in kanamycin resistant CFUs was observed when magnesium was added to the reaction mixture, presumably as a result of non-productive intramolecular transposition that can occur in the presence of magnesium ions.

In constructing a library, to achieve a 99% probability of complete coverage (i.e., an insert into each ORF) approximately 20,000 transposition events are required. The assumptions in this calculation are that transposition is truly random with respect to ORF target choice and that each ORF is the same size. Although Tn5 inserts have been found in all genes for which they have been sought, Tn5 does have target sequence biases that can skew the randomness in terms of gene distribution. There may also be unknown biases imposed on transposition by the condensation/organization of the nucleoid body. In addition, all ORFs are not the same size, but range in size in E. coli, for example, from under 100 codons to 2,383 codons. On the other hand, the results of synaptic complex electroporation/transposition described above represent inserts into dispensable functions only, since only viable colony forming units are formed. This clearly under represents the entire transposition insertion pool. Accordingly, to form a fully representative library, pools of between 50,000 and 500,000 viable transposition events should be sought. The invention described herein is sufficiently efficient to produce a library of this size.

In a second trial, MuA transposase was purchased from a commercial source. The transposase was mixed with a donor polynucleotide that included a sequence that encodes chloramphenicol resistance in E. coli flanked at both ends by Mu ends. In a typical reaction 1 μl of MuA (0.22 μg) and 3 Il of transposon, Cm^(R) (60 ng) were brought to a total volume of Al with magnesium-free buffer and were incubated for one hour at 30° C. Synaptic complexes were formed in the reaction. One μl of the incubated sample containing synaptic complexes was electroporated as above into chloramphenicol-sensitive electroporation-competent E. coli cells that were then plated and incubated. Eight chloramphenicol-resistant colonies were selected. MC1061 are suitable cells since they are efficient electroporation targets; other cells can be used, although it may be necessary to adjust the experimental parameters such as buffer, time, temperature, concentration of components. Chromosomal regions that included sequences from the polynucleotide were cloned and the nucleotide sequences of the cloned sequences were determined. Sequence data revealed that the donor polynucleotide was integrated into the host chromosome. A characteristic five base pair duplication of the end sequences from the Mu termini was observed.

The present invention is not intended to be limited to the foregoing, but to encompass all such variations and modifications as come within the scope of the appended claims.

8 1 30 DNA Bacteriophage Mu 1 tgtattgatt cacttgaagt acgaaaaaaa 30 2 30 DNA Bacteriophage Mu 2 tgaagcggcg cacgaaaaac gcgaaagcgt 30 3 30 DNA Bacteriophage Mu 3 gaaagcgttt cacgataaat gcgaaaactt 30 4 50 DNA Bacteriophage Mu misc_feature (1) single stranded tail (gatc, not shown) added to 5′ end of non-transferred strand of mu end. 4 tgaagcggcg cacgaaaaac gcgaaagcgt ttcacgataa atgcgaaaac 50 5 19 DNA Transposon Tn5 5 ctgactctta tacacaagt 19 6 19 DNA Transposon Tn5 6 ctgtctcttg atcagatct 19 7 19 DNA Artificial Sequence Description of Artificial Sequence Mosaic Sequence between Tn5 OE and IE sequences 7 ctgtctctta tacacatct 19 8 19 DNA Artificial Sequence Description of Artificial Sequence Mosaic sequence between Tn5 OE and IE sequences 8 ctgtctctta tacagatct 19 

We claim:
 1. A method for making an insertional mutation at a random or quasi-random position in cellular nucleic acid in a target cell, the method comprising the step of: introducing into the target cell a synaptic complex that comprises (a) a transposase protein and (b) a polynucleotide that comprises a pair of nucleotide sequences adapted for operably interacting with the transposase to form a synaptic complex and a transposable nucleotide sequence therebetween, under conditions that mediate transpositions into the cellular nucleic acid.
 2. A method as claimed in claim 1 wherein the method further comprises the step of: combining the transposase protein and the polynucleotide in vitro under conditions that disfavor polynucleotide strand transfer to form the synaptic complex.
 3. A method as claimed in claim 2 wherein the transposase protein and the polynucleotide are combined in vitro in a reaction that comprises magnesium ions at a level insufficient to support polynucleotide strand transfer.
 4. A method as claimed in claim 3 wherein the reaction lacks magnesium ions.
 5. A method as claimed in claim 1 wherein the transposase is MuA transposase.
 6. A method as claimed in claim 1 wherein the nucleotide sequence adapted for operably interacting with the transposase is the sequence of SEQ ID NO:4.
 7. A method for forming a synaptic complex between (a) a transposase protein and (b) a polynucleotide that comprises a pair of nucleotide sequences adapted for operably interacting with the transposase to form a synaptic complex and a transposable nucleotide sequence therebetween, the method comprising the step of combining (a) and (b) in vitro under conditions that disfavor polynucleotide strand transfer to form the synaptic complex.
 8. A method as claimed in claim 7 wherein the transposase protein and the polynucleotide are combined in vitro in a reaction that comprises magnesium ions at a level insufficient to support polynucleotide strand transfer.
 9. A method as claimed in claim 8 wherein the reaction lacks magnesium ions.
 10. A method for forming a library of insertional mutations at random or quasi-random positions in cellular nucleic acid in a plurality of target cells, the method comprising the steps of: introducing into the target cells a synaptic complex that comprises (a) a transposase protein and (b) a polynucleotide that comprises a pair of nucleotide sequences adapted for operably interacting with the transposase to form a synaptic complex and a transposable nucleotide sequence therebetween, the transposable nucleotide sequence comprising a selectable marker, under conditions that mediate transpositions into the cellular nucleic acid; and screening for cells that comprise the selectable marker. 